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For any Bell locality scenario (or Kochen-Specker noncontextuality scenario), the joint Shannon 
entropies of local (or noncontextual) models dehne a convex cone for which the non-trivial facets are 
tight entropic Bell (or contextuality) inequalities. In this paper we explore this entropic approach 
and derive tight entropic inequalities for various scenarios. One advantage of entropic inequalities 
is that they easily adapt to situations like bilocality scenarios, which have additional independence 
requirements that are non-linear on the level of probabilities, but linear on the level of entropies. 
Another advantage is that, despite the nonlinearity, taking detection inefficiencies into account 
turns out to be very simple. When joint measurements are conducted by a single detector only, the 
detector efficiency for witnessing quantum contextuality can be arbitrarily low. 

PACS numbers: 03.67.-a, 03.67.Mn, 42.50.-p 
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I. INTRODUCTION 

Quantum mechanics predicts that experiments per- 
formed by space-like separated and independent ob- 
servers may display nonlocal correlations, which cannot 
be explained solely by past interactions. The assumption 
that physical quantities have well established values pre- 
vious to any measurement and that signals cannot travel 
faster than the speed of light, as stipulated by special 
relativity, entails limits on the correlations the observers 
may obtain. Such restrictions, usually expressed as Bell 
inequalities [1], may be surpassed within quantum the- 
ory when the observers share entangled quantum states, 
and it is in this sense that the quantum correlations are 
nonlocal. 

Similarly, noncontextuality is a classically well-defined 
property of mutually compatible observables. Two ob- 
scrvables A and B are mutually compatible if the result 
for the measurement of A, even if not performed, does 
not depend on the prior or simultaneous measurement 
of B and vice versa. The notion of noncontextuality is 
precisely captured by the Kochen-Specker (KS) theorem 
[2] , stating that no noncontextual hidden variable model 
(NCHV) can reproduce the results of quantum mechan- 
ics. Interestingly, as opposed to Bell's theorem, the KS 
theorem is not state-dependent and it holds for any phys- 
ical system (composite or not) with a state space of di- 
mension higher than two. 

Focusing on the Bell scenario, once the particular sce- 
nario is defined — the number of spatially separated par- 
ties, the number of measurements settings for each party 
and the number of outcomes for each setting — the as- 
sociated local (realistic) models form a convex set with 
a finite number of extremal points, an object known as 
the local polytope [3]. The tight Bell inequalities are the 
non-trivial facets bounding the local polytope. Given this 
geometric picture, a nonlocal model is a point outside the 
local polytope, or, equivalently, a point which violates a 
Bell inequality. 

It is clear that to properly understand nonlocality, it 
should be considered from as many aspects as possible. In 



the information-theoretic approach introduced by Braun- 
stein and Caves [4], it was shown that if local realism 
holds, then the joint Shannon entropies carried by the 
measurements on two distant systems must satisfy cer- 
tain inequalities, which can be regarded as entropic Bell 
inequalities. One advantage of this entropic approach 
is that the inequalities do not depend on the number 
of outcomes of the measured observables, which implies 
that they can readily be applied to quantum systems of 
arbitrary local dimension and to the consideration of de- 
tection inefficiencies [5]. 



In this paper, our aim is to further develop this en- 
tropic program to Bell inequalities and also introduce 
entropic inequalities to the study of (quantum and post- 
quantum) contextuality. As we discuss in Sec. II and in 
more detail in [6], the standard information-theoretic in- 
equalities for joint Shannon entropies (monotonicity and 
submodularity) define a convex cone [7] whose projection 
to the joint entropies of jointly measurable observables is 
another convex cone, whose facets correspond to all the 
optimal Shannon-type entropic Bell inequalities. 



From this geometrical point of view, we investigate 
the family of chained entropic inequalities derived in 
Ref. [4] — which we have shown to be the tightest en- 
tropic inequalities in the appropriate scenarios [6] — and 
look for violations in quantum and post-quantum proba- 
bilistic theories. In particular, we show how an entropic 
contextuality inequality can be violated — for joint mea- 
surements of the commuting observables — even for arbi- 
trarily low detection efficiencies. Moreover, in Sec. Ill we 
analyze a possible conection between the entropic CHSH 
inequality and nonlocality distillation [8-13]. For a bilo- 
cality scenario [14] (a Bell scenario with two indepedent 
sources which allow entanglement swapping [15]), we de- 
rive in Sec. IV all the relevant tight (Shannon-type) en- 
tropic inequalities. 
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FIG. 1: (Color online) Some contextuality and nonlocality 
scenarios; the number of outcomes of each observable is ar- 
bitrary, a. CHSH scenario, 2 parties with 2 measurements 
settings each b. Klyachko scenario, 5 observables arranged in 
a cyclic configuration such that each observable is compatible 
with its neighbors c. Generalization of the Klyachko scenario, 
with n observables in a cyclic configuration (the n-cycle [16]). 
The unique non-trivial entropic inequality in all these cases is 
given by (3). 



II. MARGINAL SCENARIOS AND ENTROPIC 
INEQUALITIES 

In the following, we collect the basic definitions and 
results concerning entropic inequalities. For a more de- 
tailed discussion, we refer to Ref. [6]. 

Marginal scenarios. The concept of marginal sce- 
nario subsumes both Bell scenarios and contextuality 
scenarios. A marginal scenario is defined by specifying a 
set of observables X% , . . . , X n for which certain subsets 
are known to be compatible and can be jointly measured. 
If a subset of these observables can be jointly measured, 
then so can any smaller subset; therefore, the collec- 
tion of subsets of jointly measurable observables should 
be closed under taking smaller subsets. These are the 
marginal scenarios which we have discussed in [6]; see 
there for more background and references on the marginal 
problem. The measurement covers of [17] capture pre- 
cisely the same idea in a slightly different way. 

Definition 1. A marginal scenario M is a collection 
M. = {Si, . . . , 5*|^vi|} of subsets Si C {X±, . . . ,X n } such 
that if S G M and S' C S, then also S' E M. 

In this sense, every Bell scenario is a marginal sce- 
nario [17, Sec. 2.4.1]: the collection of observables 
X\ , , . , , X n should comprise all observables of all par- 
ties, and a subset of these observables is jointly mea- 
surable if it does not contain two different observables 
of the same party. For example in the bipartite case, 
with Alice having access to observables Aq, . . . ,A m _i 
and Bob to Bq, . . . , B m -i; if we write Xi = and 
Xi+m = Bi-i for i = 1, . . . , m, then the set of all observ- 
ables is {X\, . . . , X2m\, and the subsets of jointly measur- 
able observables are the empty subset, the one-observable 
subsets, and the two-observable subsets {Xi,Xj} where 
i < m and j > m. 

In a physical realization of a marginal scenario M. , one 
measures some joint statistics for every S G M. This 
means that one assigns a joint probability distribution to 



every jointly measurable set of observables. We use no- 
tation like P(l, 0\X 3 , X 5 ) for the probability of obtaining 
the outcomes X3 = 1 and X$ = in a joint measurement 
of X3 and X$ (assuming that {X3,X$} G M). 

If S G M. and S' C S, then one can take the marginal 
of the distribution assigned to S and obtain a distribution 
over the outcomes of the observables in S'. Naturally, 
this marginalized distribution should be the one assigned 
to S'. Requiring this property leads to the marginal mod- 
els of [6], or, equivalcntly, to the sheaf condition and the 
empirical models of [17]. In the case of Bell scenarios, the 
marginal models are precisely the no-signaling boxes. 

Noncontextual hidden variables. We now define when 
a marginal model is contextual. The most intuitive the- 
ories of physics are those where there exists a certain 
"hidden" variable A, distributed according to probabili- 
ties g(X) > with J^x @W = 1j sucn that A determines 
the complete future behavior of the system. Here, com- 
pleteness means that the distribution P(x\Xi, A) of any 
observable Xi, given a certain value of A, should be in- 
dependent of the outcome distributions of all other ob- 
servables. This implies that when Xi and Xj are jointly 
measurable, then their outcome distribution is given by 



P(x i ,x j \X i ,X J ) = ^2 e (X)P(x i \X i ,X)P{x j \X j ,X), (1) 



and similarly for cases where more than two observables 
are jointly measured. If there exist conditional distri- 
butions P(xi\Xi,X) and a hidden variable distribution 
g(X) such that (1) holds for all jointly measurable pairs 
{Xi,Xj} G M and more generally for all S G A4, then 
we say that we have found a noncontextual hidden vari- 
able model, and the given marginal model P is noncon- 
textual; otherwise P is called contextual. In the case of 
Bell scenarios, the noncontextual hidden variable models 
are precisely the local hidden variable models, in which 
case we also use the standard terminology of "local" and 
"nonlocal" . 

Following [16, Thm. 6] or [17, Thm. 8.1], we note that 
the noncontextuality of P is equivalent to the existence 
of a joint distribution 

P{xx, . . .,x n \Xi, . . .,X n ) = p(xi, ...,x n ) (2) 

which marginalizes to the given distributions for all S G 
M. 

The main question is: how is it possible to decide 
whether a given marginal model P in a marginal scenario 
M. is contextual or noncontextual? 

Entropic inequalities. From a joint probability distri- 
bution (2), one can define the associated Shannon en- 
tropy 



H{X x ...X n ) 
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More generally, marginalizing the joint distribution to 
any subset S € M. of the observables gives a joint entropy 
H(X$), where we write X$ for the tuple of observables 
(Xi)ies- This joint entropy H(Xs) is also defined in any 
marginal model, since the distribution of X$ is known 
for Se M. 

As has first been noticed in [4] and as we have de- 
veloped more formally in a general framework [6], the 
noncontextuality of P, i.e. the existence of a joint distri- 
bution (2), implies that the H(Xg), for S € M, satisfy 
certain inequalities which may be violated in some con- 
textual models. 

Definition 2. An cntropic contextuality inequality is a 
linear inequality in the H(Xs) for S £ M. which is satis- 
fied whenever P is noncontextual. In the special case of 
a Bell scenario, we use the term entropic Bell inequality. 

If some marginal model violates a certain entropic con- 
textuality inequality, then this inequality has witnessed 
the contextuality of the marginal model. 

In Ref. [6], we have classified the entropic contextual- 
ity inequalities in the n-cycle marginal scenarios. This 
family of scenarios is defined by starting with any num- 
ber n > 3 of observables X\ , . . . , X n and assuming that 
Xi and Xi+\ are pairwise jointly measurable for all 
i = 1 , . . . , n, where we write X n+ \ = X\ for ease of no- 
tation. No other pairs of observables are assumed jointly 
measurable, and no triples of observables are assumed 
jointly measurable. For n = 4, Fig. la shows that this 
can be identified with the CHSH Bell scenario [18]. For 
n = 5 (see Fig. lb) it is the marginal scenario considered 
by Klyachko [19, 20], and hence wc call it the Klyachko 
scenario. For general n, it can be visualized as an n-sided 
polygon (Fig. lc). Our result in [6] is that the inequal- 
ities derived in [4] are a complete set of tight entropic 
inequalities in these scenarios. (An entropic inequality is 
tight when no other entropic inequality can be strictly 
better than this one, so that a complete set of tight en- 
tropic inequalities completely characterizes the region of 
noncontextual marginal models in entropy space.) Stated 
more formally: 

Theorem 3 ([6]). A marginal model in this scenario is 
entropically noncontextual if and only if the entropic in- 
equality 

H(X i X i+1 )+ J2 //::A\. • ^//;.\,.\, (3) 

holds for all i = 1, . . . , n. 

In principle, one may also want to consider inequal- 
ities containing derived entropic quantities like condi- 
tional entropies and mutual information. However, since 
these derived quantities are themselves nothing but linear 
combinations of joint entropies, every entropic inequality 
containing the former can be rewritten in terms of the 
latter. In fact, as there are no linear relations between 
joint entropies, every linear entropic inequality turns into 



a unique normal form when expressed in terms of joint 
entropies [21, Sec. 13.2]. 

Besides this, the relevance of our general framework [6] 
lies in the fact that the standard computational geometry 
methods like Fourier-Motzkin elimination, which have 
been extensively used to characterize tight Bell inequal- 
ities in probability space, can also be applied to derive 
tight (Shannon-type) entropic contextuality inequalities 
and tight (Shannon-type) entropic Bell inequalities. Un- 
fortunately, these computations are very demanding: ap- 
plying the methods described in [6] to the tripartite Bell 
scenario with two observables per party, we have not been 
able to fully characterize the tight (Shannon-type) en- 
tropic Bell inequalities in this scenario. 

In the following, we analyze the case n = 4 (the CHSH 
scenario) and the case n = 5 (the Klyachko scenario) in 
some more detail and analyze in particular their violation 
by marginals models arising from quantum theory. 

A. The CHSH scenario 

In the usual CHSH scenario [18], there are two distant 
parties A and B, each measuring one of two observables 
Aq,A\ and B$,Bi, respectively. Each of these observ- 
ables is taken to have two possible outcomes, so that the 
set of outcomes can be taken to be { — 1, +1}. As usual 
for Bell scenarios, we take observables to be jointly mea- 
surable when they belong to different parties. In our 
framework, this can be described by the marginal sce- 
nario where the collection of jointly measurable sets of 
observables is given by 

0, {A }, {A,}, {B }, {B 1 }, 

{A ,B }, {Ao,^}, {A lt B }, {A 1 ,B 1 }. 

This is illustrated in Fig. la. The CHSH inequality [18] 

CHSH 

(4) 

together with its equivalent variants is a necessary and 
sufficient condition for noncontextuality (i.e. Bell local- 
ity) in this scenario. 

By taking n — 4 and renaming the observables occur- 
ing in (3), we obtain the entropic inequality [4] 

H(A 1 B 1 ) + H{A ) + H(B ) 

< H(AqBq) + H{A B X ) + H(A 1 B ). ( ' 

In order to emphasize the similarity with (4), we can 
rewrite this in terms of mutual information as 

I(A : B ) + I(A Q : B±) + I{A 1 :B ) - I{A 1 : B x ) 

- H(A ) - H(B ) < 0. 

The last two terms in the left-hand side are analogous 
to the classical bound of 2 in (4). Wc abbreviate the 
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FIG. 2: (Color online) a. Violation profile of the entropic 
CHSH inequality for a pure state of the form cosa|00) + 
sina|ll). The violation profile essentially coincides with the 
one of the standard CHSH inequality, although the scales are 
different. The curve is obtained by optimizing over the ob- 
servables for each a. b. Maximal violation of the entropic 
inequality (3), achieved on a maximally entangled state of 
two qubits (a = tt/4) and k = 2, 10 measurement settings 
for each party. 



left-hand side by CHSH E , 
inequality is 



so that the entropic CHSH 



CHS H E < 0. 

Note that the entropic CHSH inequality does not only ap- 
ply to ±l-valued observables A x and B y \ it is completely 
irrelevant how many outcomes each observable has — as 
long as the Shannon entropies converge — and which out- 
comes these are. 

Quantum violations of CHSHe- We now assume 
that the two parties A and B share a pure entangled 
two-qubit state. For any such state, one can choose the 
local bases such that it has the form 



cosajOO) +sina|ll), 



(6) 



and By also for the ±1- 



for some a € (0, 7r/2). Writing A x 
valued quantum observables measured on | tp) , we obtain 
the joint distributions, written in terms of the standard 



notation of conditional probabilities, 
P(a,b\x,y) 



l + (-l)M* ^ \ + {-l) b By 



(7) 



For ease of later notation, we stipulate that the observ- 
ables A x and B v are ±l-valucd, but the outcomes a, b 
are {0, l}-valucd. 

As shown in [4], quantum correlations of the form (7) 
do indeed lead to violations of (5). These violations wit- 
ness the non-existence of local hidden variable models 
for (7). Numerical optimization shows that the maximal 
violation of (5) for quantum correlations of the above 
form is achieved when all measurement settings lie in the 
Y-Z plane of the Bloch sphere, that is, for measurement 
operators A x and B y of the form sm9-a y + cos 8 -a z . (One 
could as well take them all to lie in the X-Z plane; what is 
important is that they lie in the same plane.) The max- 
imal violation is obtained for the maximally entangled 
state a = 7r/4, on which one gets CHSHe ~ +0.237. 

For other values of a, the maximal violation of (5), 
when optimized over the measurements, follows the exact 
same profile as for the standard correlator inequality (4) 
as can be seen in Fig. 2a. However, the measurements 
that maximize the violation of CHSH are not the ones 
which give the maximal violation of CHSHe- In fact, 
we will see below that those choices of observables which 
produce the maximal CHSH value for a certain a do not 
violate (5). In general, for the standard CHSH scenario, 
the violation of the standard inequality (4) is a necessary 
but not sufficient condition for the violation of (5). 

We have also considered the inequalities (3) for any 
even n = 2k as entropic Bell inequalities as fol- 
lows. When A chooses between ±l-valued observables 
A\ , . . . , Ak and B among B\, . . . , Bk , then (3) becomes 
applicable upon taking X-2i = Bi and X21—1 = Aj. 
Fig. 2b shows our numerical results for the maximal vio- 
lation of (3) on a two-qubit state (6). 

No-signaling violations of CHSHe- In the CHSH 
scenario, there is a special class of marginal models 
known as isotropic boxes, which we would now like to 
study. To begin, the Popescu-Rohrlich box (PR box) [22] 
is defined to be the marginal model 



P™(a,b\x,y) = ^ ! + (-!) 



(8) 



It is the unique marginal model which maximally vio- 
lates (4). Similarly, the isotropic box with parameter 
C E [0, 1] is defined to be the marginal model 



P iso (a, b\x, y) = i [l + C (_i) a ® & ®^ 



(9) 



It corresponds to a probabilistic mixture of P PR with 
weight C and uniform white noise P w with weight 1 — C . 

Equivalently, an isotropic box can be characterized by 
having uniformly random marginals, that is 



(A x 



(B„ 



0, 



5 



together with 

(A Q B Q ) = (AqBi) = (A^Bq) = -(AxBx) > 0. 

The parameter C is determined from this by C = (AoBq) . 
Its relation to the CHSH value of the box is simply given 
by CHSH(P iso ) = 4C*. 

Any marginal model in the CHSH scenario can be 
transformed into an isotropic box through a local depo- 
larization process, keeping the CHSH value (4) invari- 
ant [23]. Therefore, for many purposes it is enough to 
consider isotropic boxes only. 

Interestingly, no isotropic box violates the entropic 
CHSH inequality. In particular, this applies to the PR- 
box, although it maximally violates the standard CHSH 
inequality. One can understand this by noting that en- 
tropy only probes the probability values occurring in a 
distribution, but not which probability values get as- 
signed to which outcomes. This means that the PR-box 
is, as far as the entropies are concerned, equivalent to the 
marginal model P c describing classical correlations, 



P c (a,b\x,y) 



1 



1 



■1) 



(10) 



Entropic quantities cannot distinguish between the per- 
fect anti-correlation of A\ and B\ as it appears in P PR , 
and the perfect correlation of A\ and B\ as in P c . All 
joint entropies of the PR-box coincide with those of P c . 

For two-outcome measurements, the maximal violation 
of (5) is +1 for the following reason: any marginal model 
with two-outcome measurements will satisfy H(A$) < 
H(A B a ) and H(B a ) < H{A 1 B Q )] similarly, H{A l B l ) < 
1 + H(Bi) < H(A Bi) + 1. Taking these inequalities 
together shows that CHSHe < 1 for any such marginal 
model in the CHSH scenario. This bound on the violation 
can indeed by achieved by the no-signaling box 



pmax 1 p 



PR 



1 pc 
2 r 



which is an equal mixture of the PR-box with classical 
correlations. P max can be understood as the probabilistic 
model in which each of the three pairs (Ao, Bq), (Aq,Bi) 
and (Ai,Bq) displays perfect correlation, while the fourth 
pair (Ai,Bi) is uncorrelated; see also [6, Prop. 4.3]. Note 
that P max achieves a value of 3 on CHSH, and therefore 
does not have a quantum-mechanical realization since it 
is beyond Tsirelson's bound of 2y/2. 

This example shows that a convex combination of two 
non-violating marginal models may violate an entropic 
contcxtuality inequality. This highlights the strongly 
non-linear character of entropic inequalities (see also 
Fig. 4). 

Discussion. It is a basic feature of Shannon entropy 
that the entropy of a probability distribution is invari- 
ant under permutations of the sample space. From this 
point of view, we find it surprising that the entropic in- 
equality (5) can be violated at all. Violations of entropic 
contcxtuality inequalities witness a very particular kind 



of contcxtuality: if a probabilistic model violates an en- 
tropic inequality, then so does every other probabilistic 
model obtained by permuting the outcome probabilities 
of a joint measurement, provided that the permuted joint 
distribution has the same marginals. For example, this 
leads to the phenomenon observed above that the PR- 
box P PR is entropically indistinguishable from classical 
correlation P c . 

Along similar lines, the only symmetry operations that 
can be applied in order to transform an entropic contex- 
tuality inequality into an equivalent one are permuta- 
tions of the observables which map jointly measurable 
sets to jointly measurable sets. In the case of a Bell sce- 
nario, these permutations are either permutations of the 
parties, permutations of the observables of some party, 
or arbitrary combinations thereof. Rclabclings of the 
outcomes of an observable do not change the inequality, 
again due to the fact that entropies arc invariant under 
outcome permutations. 



B. The Klyachko scenario 

A very simple state-dependent proof of the Kochen- 
Specker theorem with only five two-outcome observables 
was given by Klyachko et al. in [19, 20]. The marginal 
scenario in this case is the one depicted in figure lb: there 
are five ±1- valued observables Xi, X 2 , A3, X4, A5 such 
that Xi and Xi + i (modulo 5) arc compatible. 

The so-called Klyachko inequality is a necessary and 
sufficient condition for noncontextuality in this sce- 
nario [19, 20]. It is given by 



^(A. t A, 



> 



11 



Due to Theorem 3, the only non-trivial entropic in- 
equality in the Klyachko scenario is given by, up to cyclic 
permutations of the observables, the entropic Klyachko 
inequality 

H(XiX 6 ) + H(X 2 ) + H(X 3 ) + H(X 4 ) 

- HiXxXi) - H{X 2 X 3 ) - H{X 3 X 4 ) - H(X 4 X 5 ) < 

(12) 

To investigate the quantum violations of the entropic 
Klyachko inequality, we choose two-outcome observables 
on C 3 of the form 



Xi = 2\vi)(vi\-1 
with the vectors \vi) given by 

\vi) = (0,0,1) 

1^2) = (sin 0, cos#, 0) 

1^3) = A/" -1 (cos 9 sin <fi, — sin 9 sin </>, sin 9 cos < 
\vi) = (0, cos(j), sm.(j)) 
\v 5 } = (1,0,0), 



(13) 
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(ft ~\} 

l U ! l ) 


(~\ ft") (ft ft) 


-Xii Xb 




4(f> 2 1 - 8(/) 2 


X\,X2 


l-5<f 


4^ 2 <j> 2 


X2,Xz 




l-5^ 2 |0 2 



by 



TABLE I: Joint outcome probabilities for the analytic viola- 
tion. All values are up to 0{<f> 3 ). 



where the 



norma lization factor is M = 
\J sin 2 9 + cos 2 9 sin 2 (j>. Up to choice of basis and 
multiplying the \vi) by irrelevant phases, every config- 
uration of 5 unit vectors \v\) , . . . ,\v 5 ) G M 3 with \vi) 
orthogonal to is of this form. 

Since each \vi) is orthogonal to the observable 

Xi commutes with -Xj+i, so that these two obscrvables 
are compatible and we can talk about their joint measure- 
ment. Also thanks to orthogonality, their joint outcome 
{Xi = 1, = 1) never occurs. 

We write \v{ x Uj+i) for a unit vector orthogonal to 
both \vi) and 

Upon measuring these observables on some initial state 
\ip) € C 3 , the non- vanishing joint outcome probabilities 
are given by 



P(0,l\X i ,X i+1 ) = P(l\X i+1 ) = |(^ + i|^)| 2 

p(i,o\x u x i+1 ) = p^Xi) = \{ Vi \i>}\ 2 

P(0,0\X h X i+1 ) = 1 - P(l\Xt) - P(l\X i+1 ) 
= \(vi x u l+1 \^)\ 2 . 



(14) 



Numerical results. Numerical calculations show that 
the maximal qutrit violation of (12) with observables Xi 
occurs on a qutrit state of the form 

1 

(sin a, cos a, sin a) (15) 



yl + sin 2 a 



with a ps 0.29736 and 9 = (f> ps 0.24131, for which the 
left-hand side of (12) is ps +0.091. 

Analytical proof of quantum violations. We would like 
to present an analytic proof showing that quantum vio- 
lations of (12) occur with the Xi and \tf>) of the above 
form. We set 9 = <f> and a = 2<f> and expand everything 
for (f> <C 1. Then, by symmetry, the joint outcome distri- 
bution of X4 and AT5 coincides with the one of X2 and 
X\] likewise, the joint outcome distribution of X 3 and 
Xi coincides with the one of X 3 and X 2 ] the remaining 
joint distributions, up to 0(<f) 3 ), are listed in Table I. 

With this symmetry, (12) is equivalent to 



2H(X 2 ) + 
2H(X 1 X 2 [ 



H(X 3 ) 

- 2H(X 2 X 3 ) < 0. 



(16) 



H(X 1 ) = 


-40 2 log(0 2 )H 


-O(0 2 ) 


H(X 2 ) = 


-50 2 log(0 2 )H 


-O(0 2 ) 


H(X 3 ) = 


-^log(0 2 )- 


f o(4> 2 ) 


H{XxX 2 ) = 


-50 2 log(</> 2 )H 


-0(<f> 2 ) 


H{X 2 X 3 ) = 


-50 2 log(0 2 )H 


-O(0 2 ) 


H(X 1 X 5 ) = 


-80 2 logW> 2 )H 


-O(0 2 ) 



With this, the left-hand side of (16) is -|^ 2 log(^ 2 ) + 
0{4> 2 ), which is positive for small enough <f>. 

Detection inefficiencies: single- detector model. One 
can take advantage of the fact that entropic inequalities 
can handle any finite number of outcomes and use the 
same approach to investigate the more realistic case with 
detection inefficiencies. We will consider two possible sce- 
narios: one where compatible observables are measured 
jointly (one detector), and one where compatible observ- 
ables are measured sequentially (two detectors). 

In the single-detector model with detection efficiency 
77 £ [0, 1], there is an additional outcome (0, 0) which rep- 
resents the no-click event of the detector for each jointly 
measurable pair (Xi,X i+ i). The new outcome probabil- 
ities P n are given by 



P r, (x i ,x i+ - i \X i , X l+ i) = riP(xi,Xi + i\Xi,X i+1 ) 



(17) 



where the measurments are now { — 1, +1, 0}-valued, and 
Xi,Xi+\ € { — 1,+1} are the "proper" outcomes. In this 
model, a no-detection event always occurs for both ob- 
servables simultaneously 

The joint probabilities (17) marginalize to the single- 
observable distributions 



(18) 



The corresponding relevant entropy values are given 



P(x i \X i )= V P(x i \X i ) 
P{9\Xi) = l-rj. 

Proposition 4. With this model of inefficiencies, 

H r '{X l )=i 1 H{X l ) + h{i 1 ) 
H'"(X t X l+1 ) = r)H{XiX i+1 ) + h{rj). 



where h(r]) = —rjlogrj — (1 — 77) log (1 — 77) is the binary 
entropy. 

Proof. This follows from an application of the grouping 
rule of Shannon entropy, see e.g. [24, Sec. 2.179]. □ 

Upon plugging these equations into (12), one finds that 
the contributions of h(rj) cancel, so that the left-hand side 
simply scales as a linear function of 77. Therefore, the 
entropic contextuality inequality (12) has violations for 
any 77 > 0. Moreover, the maximal violation with qutrit 
measurements of the form (12), (14) is given by the same 
state and vectors which maximizes the violation for 
7/ = (which arc a ps 0.29736 and 9 = <f> ps 0.24131). 



Detection inefficiencies: two- detector model. We now 
assume that the joint measurement of Xi and Aj + i is re- 
alized by one detector measuring Xi and another detector 
measuring Xi+\. Again, we take each detector to have 
an efficiency of 77 € [0, 1], for simplicity the same value 
for all 5 detectors, such that the no-click event of the first 
detector is independent of the no-click event of the sec- 
ond detector. A physical situtation leading to this kind 
of model may be a sequential scheme where the system 
passes through a non-demolition measurement in the first 
detector before reaching the second detector. 

Consequently, the jointly measurable pair (Xi, Xi+i) 
has an outcome distribution, with Xi G {— 1, +1}, 



P v (x l ,x l+1 \X u X l+ i) = i] 2 P(x l ,x l+1 \X ll X i+ i) 

P 1 >(x i ,9\X i ,X i+1 ) = (l-n)r)P(xi\Xi) 
P"(0,x l+1 |A l ,A m ) = (l-j 7 )7 ? P(a ;i+1 |X i+1 ) 
P^(%,$\X l ,X l+1 ) = (l-r 1 ) 2 



(19) 



which again marginalize to the single-observable distri- 
butions (18). 



Proposition 5. With this model of inefficiencies, 



(20) 



H^iXi) =nH{X t ) + h( V ) 
H"{XiX i+1 ) =r/ 2 H(X t X i+1 ) 

+ 7,(l-?7) [H(Xi)+H{X i+1 )]+h(v). 

where h{rj) = —77 log?/ — (1 — 77) log(l — 77) is the binary 
entropy. 

Proof. Again, this follows from an application of the 
grouping rule of Shannon entropy [24, Sec. 2.179]. □ 

Due to the additional terms in (20), the required detec- 
tion efficiency for witnessing quantum violations in the 
two-detector model turns out be very high, 77 rj 0.995. 



III. NONLOCALITY DISTILLATION AND THE 
ENTROPIC CHSH INEQUALITY 

As we have seen in the previous sections, finding vi- 
olations of an cntropic contextuality inequality, for ex- 
ample of the entropic CHSH inequality (5), is not easy, 
and there are few quantum-mechanical models which do 
violate them. Therefore, we regard the violation of an en- 
tropic contextuality inequality (entropic Bell inequality) 
as a witness of a very strong form of contextuality (nonlo- 
cality). So what does the violation of an entropic contex- 
tuality inequality tell us about the violating model? Can 
the violation be regarded as a resource for something? 

Since Shannon entropy is an asymptotic quantity 
which measures the effective size of a probability distri- 
bution on the level of many copies (see for example the 
Asymptotic Equipartition Property [21]), we also expect 
any answers to these questions to be concerned with the 
limit of many copies. 



x,(x)- 
a,. 
x 2 (x,a,) - 
a 2 . 



■ Yi(y) 
-b, 

■y 2 (y.b 2 ) 

-b 2 



a(x,a„a 2 ) 



b(y,b„b 2 ) 



FIG. 3: (Color online) General nonlocality distillation proto- 
col with two boxes. The inputs for the second box may locally 
depend on the output of the first one and the final output of 
the combined boxes is a function of all inputs and outputs. 



In this section, we would like to consider the case of 
the CHSH scenario and investigate a bit, on a purely phe- 
nomenological level, one particular property which also 
has an asymptotic flavor: the property of nonlocality dis- 
tillation. However, we barely have a precise hypothesis 
to offer — let alone a proof — and the similarities we will 
describe in the following may easily turn out to be su- 
perficial. 

Considering the CHSH scenario, it was recently shown 
that nonlocality can be distilled [9-13]: by locally pro- 
cessing several copies of certain bipartite no-signaling 
boxes, one can increase the amount of nonlocality ac- 
cording to a particular nonlocality measure. In the fol- 
lowing, we consider bipartite no-signaling boxes with bi- 
nary inputs and outputs. Each party can wire boxes to- 
gether using classical circuitry to produce a new binary- 
input /binary-output box. 

The general distillation protocol with two copies of a 
no-signaling box is displayed in Fig. 3. A certain protocol 
distills the nonlocality if the no-signaling box obtained af- 
ter the wiring is more nonlocal — according to a certain 
measure — than the original box. As a measure of non- 
locality, we will consider the nonlocal part of the EPR2 
decomposition [25], defined as follows. Any no-signaling 
box P can be decomposed into convex combinations of a 
purely local box P L and another box P NL , 



P = (1 - q)P L + qP 



NL 



(21) 



with some coefficient q € [0, 1]. The minimal coefficient q, 
when minimized over all such decompositions is the non- 
local content of P. By definition, computing the nonlocal 
content is a linear program. In the case of the CHSH sce- 
nario, there is a linear relationship between the nonlocal 
content and the violation of the standard CHSH inequal- 
ity [13]; therefore, in the present context we may regard 
the CHSH value as our measure of nonlocality. 

We now focus on two particular wirings. The first one 
was proposed in [9] and is given by 



X\ = x, X2 = x, a = a\ 
Vi = V, V2 =y,b = bii 



3 a 2 
62, 



(22) 
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FIG. 4: (Color online) Slice of the no-signaling polytope cor- 
responding to the family of boxes (25). The quantum region 
and CHSHe violations are bounded respectively by the thin 
black and thick black curves. The regions of distillability 
by the wirings (22) and (23) correspond respectively to the 
dashed-dotted red and dashed blue lines. The region of vio- 
lations of the entropic CHSH inequality is strictly larger than 
the region of correlations distilable with the wiring (23). On 
the other hand, in a small part of the quantum region, non- 
locality can be distilled by the protocol (22) while not being 
detected by CHSHe- 



while the second one was proposed in [13] and is given 
by 



x\ = x, x 2 = x ® ax ® 1, a = a\ ® a 2 ® 1 
2/i = 1,2/2 = yb 1 ,b = b 1 ® b 2 ®l. 



As shown in [13], these two wirings completely character- 
ize the distilability in the CHSH scenario on the two-copy 
level. 

For example, for the wiring (22), the wired no-signaling 
box is given by 



FIG. 5: (Color online) Distilled nonlocality (increase in nonlo- 
cal content) of the family (27) upon application of the wiring 
(30) for d = 2,4 (red full line), d = 3 (blue large dashed line) 
and d = 5 (black dashed line) as computed by linear program- 
ming. The inset shows the violation of CHSHe, which does 
not depend on d. 



For 7 = 0, this is a family of boxes with CHSH = 0. 
The family (25) forms a triangle in the no-signaling poly- 
tope extending from the boundary of the local polytope 
up to the PR-box. As displayed in Fig. 4, the subset of 
boxes violating CHSHe and the subset of boxes distil- 
lable via (23) or (22) are very similar. We expect that 
by going beyond the two-copy level and considering all 
possible wiring protocols, the subset of distillablc boxes 
will enlarge. It seems reasonable to ask whether it will 
contain all boxes violating CHSHe- In general, could 
the violation of CHSHe be a sufficient condition for dis- 
tillability? 

Moreover, we have investigated the bipartite scenario 
with two d-outcomc obscrvablcs per party, to which 
CHSHe still applies. We have considered the family 
of no-signaling boxes 



P'(a,b\x,y)= P{ai,bi\x,y)P(a 2 ,b 2 \x,y), (24) where P™ is the generalized PR-box [26] 



ai+a2— a 
b 1 +b 2 =b 

As for any wiring protocol with two copies of the original 
box, the new box P 1 is a quadratic function of the original 
one. So the first (superficial) similarity between wirings 
and the entropic CHSH inequality is nonlinearity. 

Interestingly, in the case of isotropic boxes — which 
we found in II A not to violate the entropic CHSH 
inequality — distillation is not possible [8]. More gener- 
ally, we have considered the family of boxes 



Pf R (a,b\x,y) 



1/d 




if a — b = xy 
otherwise 



mod d 



while P^ is the classically correlated box 



P d c (a,&|x,y) 



1/d iia = b 
otherwise 



(27) 



(28) 



(29) 



7,? 



-fP PR +£P C + (l_ 7 _f)p/ 



(25) 



with parameters 7, £ G [0, 1], P PR and P c given respec- 
tively by (8) and (10), as well as 



It follows from the definition (29) and an application of 
the CGLMP inequality [27] that the nonlocal content of 

is simply £. On the other hand, CHSHe = — £ log£ — 
(1 — £) log 1 — £ turns out to be the binary entropy. 

To probe the nonlocality distillation of Pj, we have 
considered the wiring 



pf(a,b\x, y ) = ±[2+(-l) a ^ 



X\ = x, X2 = xa\ mod 2, a = (a\ + a 2 ) mode? 
Vi = V, V2 = yb\ mod 2,6= (bi + b 2 ) mod d 



(30) 



(26) 



which is P f = \P PR 
box and white noise. 



ip 1 ", half-way between the PR- 



which can be regarded as a generalization of the wiring 
proposed in Ref. [10]. Fig. 5 shows the increase of 
nonlocal content achievable with this protocol for any 
d = 2, . . . , 5 and the value of CHSHe', we observe qual- 
itatively identical behavior. 
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FIG. 6: (Color online) Swapping experiment scenario, where 
two independent entangled states are prepared and sent to 
three distant parts. If the central part B measures in a en- 
tangled basis, the subsystems A and C, that had no previous 
contact, become nonlocally correlated. 



IV. ENTROPIC BILOCALITY INEQUALITIES 

Bilocality scenarios. In entanglement swapping [15], 
there are two sources of entangled quantum states. The 
first source (on the left in Fig. 6) sends half of its entan- 
gled state to A and the other half to B; the second source 
(on the right in Fig. 6) sends half to B and half to C. If B 
applies the right kind of entangled measurement between 
the two quantum systems which he receives, then, con- 
ditioned on an outcome of this measurement, the post- 
measurement state between A and C will be entangled. 

This idea has been used to obtain strong bounds [14] 
on the existence of local hidden variable models of entan- 
gled quantum states augmented by the assumption that 
the hidden variables Ai and A2 describing the two sources 
are probabilistically independent; this is the bilocality as- 
sumption. 

Under the assumption of local realism without the bilo- 
cality assumption, the conditional probabilities for the 
scenario of Fig. (6) would have the form 

P (a,b,c\x,y,z) 

= J28(M,X2)P(a\x,Xi)P(b\y,Xi,M)P(c\z,X 2 ) ( 31 ) 

Ai,A2 

where g(Xi,X 2 ) is the probability distribution over the 
pairs of hidden variables. As usual, {x, y, z} and {a, b, c} 
describe, respectively, the inputs and outputs at each lo- 
cal part. Since the two systems received by Bob are being 
jointly measured, they can be treated as a single entity. 

Since g can be chosen such that Ai = A2 occurs with 
probability 1, such a model is equivalent to one of the 
form 

P (a, b, c\x, y, z) 

= Y^Q (A) P(a\x, X)P(b\y, X)P(c\z, A), ( 32 ) 



which is the standard description of local realism in a 
three-party Bell scenario. 

The bilocality assumption in addition imposes inde- 
pendence of Ai and A2, so that the distribution g is 
required to factor as p(Ai,A2) = gi(Xi)g 2 (X 2 ), which 



means that a bilocal model is one of the form 
P(a,b,c\x,y,z) 

= Qi(*i)Q2(X2)P{a\x,X 1 )P(b\y,X 1 ,X 2 )P{c\z,X 2 ). 

(33) 



Ai ,Ao 



A direct calculation shows that every such model sat- 
isfies, besides the usual no-signaling equations, also the 
condition 

-P(a, b, c\x, y, z) = P(a\x)P(c\z) Va,c,x,y,z (34) 



where P(a\x) and P(c\z) are the marginal behaviors of 
A and C, respectively. 

However, there are no-signaling boxes P(a,b,c\x,y, z) 
which satisfy (34), but nevertheless are not bilocal, 
i.e. cannot be written in the form (33). Many exam- 
ples of this form are even local boxes of the form (32). 
We will soon see more concrete examples of this in which 
the "non-bilocality" can be witnessed by an entropic in- 
equality. 

Entropic bilocality inequalities. Due to the nonlinear- 
ity of the bilocality condition g(Xi,X 2 ) = gi(Xi)g 2 (X 2 ), 
the set of P(a 7 b, c\x, y, z) of the form (33) is not con- 
vex [14], and it is difficult to determine whether a given 
P lies in this bilocal set or not. This is where entropic 
inequalities enter: as we will show in the following, they 
give necessary requirements for P to be bilocal in terms 
of inequalities linear in the entropies of P. This lin- 
earity is already visible on the level of the two sources, 
whose probabilistic independence is equivalent to vanish- 
ing mutual information, I(Xi : X 2 ) = 0, which is a linear 
entropic equation 

H(X 1 X 2 ) = H(X 1 ) + H(X 2 ). 

For the sake of concreteness, we consider the specific 
scenario where A and C have two available measurement 
settings to choose from, whereas B always applies the 
same fixed measurement, so that x, z € {0, 1} and y = 0. 
This corresponds to 5 observables Aq,Ax,B,Cq,C\. A 
subset of these 5 observables is jointly measurable when- 
ever it contains at most one observable of A and at most 
one of B. Following [6], one can visualize the marginal 
scenario as in Fig. 7. In addition, the observables Aq 
and A\ are independent of the observables Co and C\\ 
therefore, in order to calculate the entropic inequalities 
for this scenario following the procedure of [6], we need 
to consider the independence constraint 

I(A A 1 : C d) = 0, (35) 

which we write out in terms of joint entropies as 

HiAoAiCoC!) = H{A a A- i ) + H(C d). (36) 

Note that the data processing inequality implies that the 
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FIG. 7: The simplicial complex of jointly measurable observ- 
ables in the bilocality scenario discussed in the main text. 
Two observables are jointly measurable whenever they share 
an edge; three observables are jointly measurable whenever 
they are the vertices of one of the four triangles. 



independence constraints 

I(A m :C o C 1 )=0, 

I(A A 1 : C z ) = 0, (37) 
I (Ac : C x ) = 0, 

follow from (35). 

We have used the computational approach of [6], aug- 
mented by the independence constraints (36) and (37) 
written out in terms of joint entropies, in order to 
calculate all the (Shannon-type) entropic inequalities 
in this bilocality scenario; although including the con- 
straints (37) would not have been strictly necessary, it 
helps in speeding up the computation. This computa- 
tion has resulted in 4 equations and 52 tight inequalities. 
The 4 equations are precisely the independence condi- 
tions I(A X : C z ) = 0. The 52 inequalities fall into 10 
symmetry classes which we have listed in Table II. The 
first four inequalities are trivial in the sense that they 
will hold for any no-signaling box P(a, b, c\x, y, z) satis- 
fying (34), while the other six inequalities can potentially 
be violated by non-bilocal boxes. 

Looking for quantum violations. We now consider the 
quantum case. Instead of sending out independent hid- 
den variables Ai and A2, the two sources now emit en- 
tangled quantum states. We take these to be given by 
generic partially entangled two-qubit states 

cos(9 fc |00) + sin0 fc e #fc |ll) 

with k = 1,2 indexing the two sources. Then A and 
C receive one qubit each, while B receives two. Upon 
choosing the two-qubit measurement of B to be in the 
Bell basis and numerically optimizing over all projective 
measurements for A and C, we have not been able to find 
any quantum violation of any of our 52 entropic bilocality 
inequalities. 

On the other hand, it is not difficult to design 
some general no-signaling boxes P(a, b, c\x, z) which sat- 
isfy (34), but violate some of our entropic inequalities. 
This applies for example to the family of boxes, for pa- 




0.0 0.2 0.4 0.6 0.8 

7 

FIG. 8: (Color online) Parameter space of the family (38). 
The red dot-dashed line delimits the region where (39) 
and (40), and therefore (38), are local as witnessed by the 
CHSH inequality. The region above the blue dashed line 
contains violations of the entropic bilocality inequality #7 
from Table II. The black line delimits the region where (39) 
and (40), and therefore (38), have a quantum realization with 
a single 3-qubit source. There is a small region (inset) in 
which our inequality detects that (38) is not bilocal, although 
the box is tripartite local. 



rameters £,76 [0, 1], 

P NB (a,b,c\x,z) (38) 

= 1 (l _|_ £^_]^affi&©c©xz *y)(— ]^<*ffl&©<^ 

This box can be understood as follows. The two out- 
comes b <E {0, 1} both occur with probability 1/2; if b = 0, 
then this creates between A and C the bipartite box 

zpp* + 7 p c + (1 - e - i)p w (39) 

and if b — 1 , then the resulting box between A and C is 

£P APR + 1 P Ac + (l-£-j)P w , (40) 

where P APR and P Ac stand for an "anti-PR-box", de- 
fined as in (9) with C = — 1, and classical anti- 
correlations, respectively. 

Depending on the value of £ and 7, the box P NB can 
be tripartite local, tripartite nonlocal but tripartite quan- 
tum, or post-quantumly tripartite nonlocal. P NB satis- 
fies (34) since the bipartite marginal between A and C is 
pure white noise, P NB {a 1 c\x, z) = P w {a 1 c\x, z) = 1/4. 

pNB v i i a tes some of our entropic bilocality inequali- 
ties. Wc focus on the inequality 7 from Table II. Fig. 8 
shows the region of violations as a function of the para- 
ramcters £ and 7. Interestingly, even in the region where 
P NB is local as a tripartite box, the entropic bilocal- 
ity inequality can be violated. This witnesses that in 
those parameter ranges, P NB cannot be written in the 
form (33), although the box is local and (34) is satisfied. 
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Entropic Bilocality Inequalities 



# 




H(B) 






H(BC Z ) 


H{A x BCz) 


X 1 z 1 XZ 
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1 
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00 01 10 11 
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1 -1 
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1 
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1 -1 


1 -1 


-10 
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1 





1 








0-1-11 


8 


1 
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-1 1 





-11 0-1 


9 


1 





1 


-1 1 


-1 1 


1-1-10 


10 











1 


1 


-1 -1 -1 1 



TABLE II: All classes of entropic bilocality inequalities. We have listed the coefficients of one inequality in each row, and all 
inequalities are of the form < 0. 



V. CONCLUSION 

In this work, we have exploited our general frame- 
work [6] for deriving entropic contextuality inequalities 
and entropic Bell inequalities. The standard methods of 
computational geometry (like Fouricr-Motzkin elimina- 
tion) , which have been applied widely to the computation 
of tight Bell inequalities, can be used for the computation 
of tight (Shannon-type) entropic contexuality inequali- 
ties (Bell inequalities). Following [17] and related work, 
our framework also treats nonlocality as a special case of 
contextuality. 

We also had shown in [6] that the family of chained 
entropic inequalities derived by Braustcin and Caves [4] 
are the only non-trivial facets in the appropriate scenario 
(Theorem 3). Given that, we have investigated quantum 
and more general violations of these inequalities both in 
the Bell scenario case and in the contextuality scenario 
case. Using a model of joint detection for compatible ob- 
servables, we noticed that quantum violations of a certain 
entropic contextuality inequality exist for any positive 
detection efficiency. Furthermore, we have fully charac- 
terized the entropic inequalities for the simplest bilocal- 
ity scenario. The entropic bilocality inequalities can be 
violated by general no-signalling correlations respecting 
the obvious condition (34), but no quantum violations 
could be found in terms of two independent sources of 
entangled quantum states. 

We have asked the question what the violation of an 
entropic contextuality inequality or entropic Bell inequal- 
ity might be a resource for. In the case of the CHSH 
scenario, we have approached this by noting some super- 
ficial similarities with the distillation of nonlocality. We 
have speculated that the violation of the entropic CHSH 
inequality may be a sufficient condition for the possibil- 
ity of distillation. If this would turn out to be true, it 
may be very useful, since in general it is very difficult to 
decide which no-signaling boxes are distillable and which 



ones are not. 

With our general framework, many more possibilities 
can be explored. For example, the principle of informa- 
tion causality (IC) [28], which has been proposed in or- 
der to understand the implausible consequences of super- 
quantum correlations, is also an entropic inequality. It 
can be shown that the inequality defining IC is, up to 
symmetries, the only non-trivial facet of an entropic cone 
defined by the IC scenario [29]. In principle, modifying 
the scenario will therefore let us derive many other IC- 
like principles as entropic inequalities, in particular some 
with a multipartite flavor. 

Since practical computations with entropic cones are 
extremely demanding, it would be helpful to have more 
efficient algorithms for the computation of facets of poly- 
hedral cones in order to characterize all (Shannon-type) 
entropic Bell inequalities for bipartite scenarios with 
more observables per party, multipartite Bell scenarios, 
and IC scenarios. To identify nonlinear problems that 
turn to be linear in terms of entropies, as we have done 
with bilocality, may also be an appealing line of future 
research. 



Postscript 

While finishing this paper, the work [30] has appeared, 
which also contains some of our results of section II B. 
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